A corpus-based approach for automated LOINC mapping

نویسندگان

  • Mustafa Fidahussein
  • Daniel J. Vreeman
چکیده

OBJECTIVE To determine whether the knowledge contained in a rich corpus of local terms mapped to LOINC (Logical Observation Identifiers Names and Codes) could be leveraged to help map local terms from other institutions. METHODS We developed two models to test our hypothesis. The first based on supervised machine learning was created using Apache's OpenNLP Maxent and the second based on information retrieval was created using Apache's Lucene. The models were validated by a random subsampling method that was repeated 20 times and that used 80/20 splits for training and testing, respectively. We also evaluated the performance of these models on all laboratory terms from three test institutions. RESULTS For the 20 iterations used for validation of our 80/20 splits Maxent and Lucene ranked the correct LOINC code first for between 70.5% and 71.4% and between 63.7% and 65.0% of local terms, respectively. For all laboratory terms from the three test institutions Maxent ranked the correct LOINC code first for between 73.5% and 84.6% (mean 78.9%) of local terms, whereas Lucene's performance was between 66.5% and 76.6% (mean 71.9%). Using a cut-off score of 0.46 Maxent always ranked the correct LOINC code first for over 57% of local terms. CONCLUSIONS This study showed that a rich corpus of local terms mapped to LOINC contains collective knowledge that can help map terms from other institutions. Using freely available software tools, we developed a data-driven automated approach that operates on term descriptions from existing mappings in the corpus. Accurate and efficient automated mapping methods can help to accelerate adoption of vocabulary standards and promote widespread health information exchange.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A method for the automated mapping of laboratory results to LOINC

LOINC is emerging as the standard for laboratory result names, and there is great interest in mapping legacy terms from laboratory systems to it. However, the mapping task is non-trivial, requiring significant resource commitment and a good understanding of the LOINC identifying attributes for the laboratory result names. Because the number of results in a laboratory system may range from aroun...

متن کامل

Application of a Regenstrief RELMA V.6.6 to Map Russian Laboratory Terms to LOINC.

BACKGROUND Manual mapping of laboratory data to Logical Observation Identifiers Names and Codes (LOINC) requires a major effort. Application of the LOINC mapping assistant RELMA V.6.6 can reduce the effort required for mapping. The goal of the paper is to perform a semi-automated mapping of Russian laboratory terms to LOINC. METHODS A semi-automated mapping of the 2563 terms from two clinics ...

متن کامل

Learning from the crowd while mapping to LOINC

OBJECTIVE To describe the perspectives of Regenstrief LOINC Mapping Assistant (RELMA) users before and after the deployment of Community Mapping features, characterize the usage of these new features, and analyze the quality of mappings submitted to the community mapping repository. METHODS We evaluated Logical Observation Identifiers Names and Codes (LOINC) community members' perceptions abo...

متن کامل

Proposed Algorithm with Standard Terminologies (SNOMED and CPT) for Automated Generation of Medical Bills for Laboratory Tests

OBJECTIVES In this study, we proposed an algorithm for mapping standard terminologies for the automated generation of medical bills. As the Korean and American structures of health insurance claim codes for laboratory tests are similar, we used Current Procedural Terminology (CPT) instead of the Korean health insurance code set due to the advantages of mapping in the English language. METHODS...

متن کامل

The Map to LOINC Project

We describe a pilot project to standardize local laboratory test names to Logical Observation Identifier Names and Codes (LOINC) at five Indian Health Service (IHS) medical facilities. An automated mapping tool was developed to assign LOINC codes. The laboratory test names not mapped to LOINC by the mapping tool were assigned LOINC codes manually. The results achieved matched current benchmarks.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of the American Medical Informatics Association : JAMIA

دوره 21 1  شماره 

صفحات  -

تاریخ انتشار 2014